Uniting Multi-Scale Local Feature Awareness and the Self-Attention Mechanism for Named Entity Recognition

نویسندگان

چکیده

In recent years, a huge amount of text information requires processing to support the diagnosis and treatment diabetes in medical field; therefore, named entity recognition (DNER) is giving rise popularity this research topic within particular field. Although mainstream methods for Chinese can effectively capture global context information, they ignore potential local sentences, hence cannot extract features through an efficient framework. To overcome these challenges, paper constructs corpus proposes RMBC (RoBERTa Multi-scale CNN BiGRU Self-attention CRF) model. This model that unites multi-scale feature awareness self-attention mechanism. first utilizes RoBERTa-wwm encode characters; then, it designs context-wise module, which captures containing locally important by fusing multi-window attention with residual convolution at adds mechanism address restriction bidirectional gated recurrent unit (BiGRU) capturing long-distance dependencies obtain semantic information. Finally, conditional random fields (CRF) are relied on learn dependency between adjacent tags optimal tag sequence. The experimental results our constructed private dataset, termed DNER, along two benchmark datasets, demonstrate effectiveness paper.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extended Feature Set for Chemical Named Entity Recognition and Indexing

The BioCreative IV CHEMDNER Task provides participants with the opportunity to compare their methods for chemical named entity recognition (NER) and indexing in a controlled environment. We contributed to this task with our previous conditional random field based system [1] extended by a number of novel general and domain-specific features. For the latter, we used features derived from two exis...

متن کامل

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Simplified Feature Set for Arabic Named Entity Recognition

This paper introduces simplified yet effective features that can robustly identify named entities in Arabic text without the need for morphological or syntactic analysis or gazetteers. A CRF sequence labeling model is trained on features that primarily use character n-gram of leading and trailing letters in words and word n-grams. The proposed features help overcome some of the morphological an...

متن کامل

Exploiting Feature Hierarchy for Transfer Learning in Named Entity Recognition

We present a novel hierarchical prior structure for supervised transfer learning in named entity recognition, motivated by the common structure of feature spaces for this task across natural language data sets. The problem of transfer learning, where information gained in one learning task is used to improve performance in another related task, is an important new area of research. In the subpr...

متن کامل

Feature Subset Selection Using Genetic Algorithm for Named Entity Recognition

In this paper, genetic algorithm (GA) is utilized to search for the appropriate feature combination for constructing a maximum entropy (ME) based classifier for named entity recognition (NER). Features are encoded in the chromosomes. The ME classifier is evaluated for the 3-fold cross validation with the features, encoded in a particular chromosome, and its average F-measure value is used as th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2023

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math11112412